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Sound reproduction system, program and data carrier rr -/^ .. p 



The invention relates to a sound reproduction system comprising: 
a sound production means; 

an audio processor with a jSlter arranged for applying a first head related 
transfer function CHRTF) to an input audio signal from an audio signal source, and yielding 
5 an output audio signal for the sound production means; and 

a first data source, which is arranged for delivering first filter coefficients of 
the first head related transfer fimction to the filter. 

The invention also relates to a computer program for execution by a processor, 
describing a method of sound reproduction comprising the steps: 
10 - obtaining coefficients of a first head related transfer fimction filter from a first 

data source; 

applying a first head related transfer fimction filtering to an input audio signal 
from an audio signal source, yielding an output audio signal. 

The invention also relates to a data earner storing a computer program for 
1 5 execution by a processor, describing a method of soimd rqproduction comprising the steps: 
obtaining coefficients of a first he?id related transfer fimction filter from- a first 

data source; 

applying a first head related transfer fimction filtering to an input audio signal 
from an audio signal source, yielding an output audio signal. 
20 The invention also relates to a data carrier storing a first head related transfer 

fimction over a first predetermined frequency range. 

The invention also relates to a signal transmission system transmitting a first 
head related transfer fimction over a first predetermined frequency range. 

25 

Such a sound reproduction systCTi is known from WO-A-0149066. The 
preferred embodiment of the known system consists of a pair of headphones, arranged to 
simulate any sound source around a user of the headphones, which is called a headphone 
vfrtuaUzer. The headphones can simulate e.g. loudspeakers required for Dolby digital 5.1 - 
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i.e. a left, a center and a right loudspeaker, at least one left surround and one right surround 
speaker, and if required a speaker for the low frequency effects - hence they obviate the need 
of using all these loudspeakers. Furthermore, a true fhree-dimensional sound field can be 
simulated as e.g. in the ambiophonics approach. Also any room acoustics can be simulated. 
5 To present to the user*s ear the same sound from a closeby headphone 

loudspeaker as would be presented by a faraway loudspeaker driven by an input audio signal, 
the input audio signaThas to be filteredl>y a head related transf^Tunction, yielding an output 
audio signal to be inputted to the headphone loudspeaker. The HRTF characterizes the 
transmission of the soxmd from the faraway loudspeaker to an ear, after e.g. reflecting on 

10 walls, interacting with tibie torso and piima of the user, etc. Generally, virtualizers use a 
standard head related transfer function, e.g. for an average human. The drawback of this 
approach is that the user has learned to localize a soimd with his ovra particular body and 
pinna shape, and hence with an incorrect HRTF sounds are perceived as coming from inside 
the head, or the front and back directions are confused. The known system avoids this 

1 5 drawback by incorporating noicrophones in the headphones measuring the sound from a 

faraway loudspeaker as it enters the ear. On the basis of these measurements, a HRTF for the 
particular user is obtained. 

It is a disadvantage of the known system that the measured HRTF has 
inaccurate values for some frequency regions, leading to inaccurate positioning of a virtual 

20 sound source. 



It is a first object of the invention to provide a soxmd reproduction system of 
the kind described in the opening paragraph, which is relatively accurate. 
25 It is a second object to provide a computer program according to the invention 

as described in the opening paragraph. 

It is a third object of the mvention to provide a data carrier storing a computer 
program according to the invention. 

It is a fourth object of the invention to provide a data carrier storing two 
30 complementary HRTFs. 

It is a fifth object of the invention to provide a signal transmission system 
transmitting two complementary HRTFs. 

The first object is realized in that a second data source is present, which is able 
to deliver second filter coefficients of a second head related transfer function to the filter for 
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laitering the input audio signal yielding the output audio signal. The filter can apply the first 
HRTF over a first frequency range where the coefficients of the first HRTF are accurate, and 
apply the second HRTF over a second frequency range, e.g. the firequency range where the 
coefiBcients of the first HRTF are inaccurate. The coeflBcients of the second HRTF can then 

5 be determined with a more reliable method, e.g. a measurement in a dedicated laboratory. 

An additional advantage of using two or more HRTFs is that desurable effects 
can be implemented more easily, such as e.g. changing the acoustics of a virtual room. 
Adding a virtual carpet to the room e.g. can be modeled by changuig a HRTF in a firequency 
range of higher firequCTicies, for which the carpet is highly absorbing. 
The first, second and if required further frequency ranges can also be predetermined 
overlapping instead of disjimct, e.g. ixi an application which selects a firequency range for 
easy postprocessing. 

It is advantageous if a microphone is present for performing a sound 
measurement and the first data source comprises coefficient calculation means for calculating 

15 the first filter coefficients from the sound measurement, and the second data source 

comprises a memory fi>r storing data related to the second head related transfer fimction. The 
memory can then contain e.g. the coefficients of a standard HRTF for the higher firequencies, 
which was measured in a laboratory. 

Additionally, it is also advantageous if the second data source comprises 

20 calculating means for calculating coefficients of the second head related transfer fimction 
filter based on data firom the memory. The calculating means can then e.g. derive a 
parametric HRTF for the higher firequencies, e.g. based on measurements of the geometrical 
or audio characteristics of the ear of tiie user. Required parameters and formulae can be 
stored in the memory. 

25 The second and third objects are realized in that the method of the computer 

program according to the invention, stored on the data carrier according to the invention, 
comprises fiirther steps of 

obtaining coefficients of a second head related transfer fimction firom a second 
data source; and 

30 - applying a second head related transfer fimction filtering to an input audio 

signal &om an audio signal somrce, yielding an output audio signal. 
The fourth and fifih objects are realized in that 

also a second head related transfer function over a second predetermined 
firequency range (F2) is stored respectively transmitted, and 
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the second head related transfer function comprises complementary 
information, improving the simulation of sound from a loudspeaker by means of sound 
production means. 

5 

These and other aspects of the sound reproduction system, the computer 

progTdixxrmSr&i^^ 

with reference to the implementations and embodiments described hereinafter, and wifli 
reference to the accompanying drawings, which serve merely as non limiting illustrations. 
10 In the drawings: 

Fig. 1 schematically shows a sound reproduction system according to liie 

invention; 

Fig. 2 schematically shows a head related transfer function (HRTF); and 
Fig. 3 shows a HRTF measmrement facility. 
15 In these Figures elements drawn dashed are optional, depending on the desired 

embodiment. 



Fig. 1 shows an audio signal source 110, e.g. a radio or a DVD-player, which 
20 sends an input audio signal 106 to a loudspeaker 120 for sound reproduction. Also shown is a 
pair of headphones, comprising headphone loudspeakers 1 01 and 1 03 to produce 
substantially the same sound from the input audio signal 106 at an ear of a user 155 as the 
loudspeaker 120. To achieve this an audio processor 105 is present, which comprises a filter 
130 for filtering input signal 106 yielding output signal 108 for the headphone loudspeakers 
25 101 and 103. Additionally, a first data source 131 is present for delivering first filter 

coefficients of a first head related transfer function 200- shown in Fig. 2- to the filter 130, 
and a second data source 100 is present for delivering second filter coefficients of a second 
head related transfer function, e.g. 202, to the filter 130. The second data source 100 may 
comprise a memory 150. In this memory 150 a number of HRTFs can be stored, e.g. as 
30 measured in a laboratory for a number of test persons. Memory 150 can also store parameters 
and formulae required by calculating means 152 for modeling a HRTF. Vice versa 
coefficients resulting from modeling by calculating means 152 can be stored in memory 150 
for latCT use, instead of being sent to filter 130 directiy. A data loading means 180 may be 
present for loading data in the macnory, e.g. by attaching a drive for a data carrier 300, or a 
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cotmection to the internet. The data may also be preloaded in memory 150 in the factory. 
User 155 can e.g. select one of the HRTFs stored in memory 150, for which the values in the 
frequency range higher than 12 kHz are most desirable, e.g. because they yield optimal sound 
localization. 

Loudspeaker 120 has to be interpreted as a predeteraiined sound source 
depending on the application, such as: 

a left surround loudspeaker in a home cinema application; 

a sound generated by a virtual character of a computer game or virtual reality 
application in a certain position in a virtual space; or 

a sound from a particular direction associated to one of a multitude of 
communication channels in a speech communication application, such as applied in a 
command center. 

The sound travels to the ear of a user 155 via a direct path 160, but also via 
indirect paths, such as reflecting path 161, which reflects from an object 140. Objects can 
also absorb and scatter sound, and sound can also be transmitted, e.g. by a window. The 
sound also interacts with various body parts of the user 155, before it finally enters his ear. 
This process results in that the relative power at a particular soimd frequency as measured by 
the ear is increased or decreased compared to the power emitted by loudspeaker 120, which 
can be modeled by a filter function. This filta: function is called a head related transfer 
function (HRTF), which is user-dependent, room-dependent, dependent on the direction of a 
sound source like loudspeaker 120, and dependent on the position of user 155 and 
loudspeaker 120 in the room. It is possible to simulate the sound produced by loudspeaker 
120 as perceived by the ear of user 155, by applying substantially the same sound field to the 
ear by means of a headphone loudspeaker 103, or in general another loudspeaker in the room. 
The input audio signal 106 can not be applied directly to the headphone loudspeaker 103, for 
then the user 155 would perceive the sound as coming from the position of the headphone 
loudspeaker 103, instead of as coming from the position of loudspeaker 120. In order to 
achieve a convincing simulation of loudspeaker 120, the input audio signal has to filtered by 
an audio processor 105, in which filter 130 applies a fkst HRTF, which is substantially the 
true HRTF 200 for the particular user 155, position, and so on. In general headphone 
virtualizers use a fixed HRTF, or an HRTF selected from a number of stored HRTFs 
measured in advance for different people. If the HRTF used is not the one corresponding to 
the partictdar user 155, accurate sound location in general does not occur. E.g. a sound source 
in the back can be erroneously heard in front, the elevation of the sound will be misjudged, or 
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the sound might be perceived as originating inside the head. To obtain the trae HRTF 200, a 
left microphone 1 13 is present for measuring sound e.g. as it enters the ear of user 155, and 
similarly to obtain the HRTF from loudspeaker 120 to the other ear, a right microphone 1 1 1 
is present WO-A-0149066 describes an algorithm for obtaining the HRTFs from 
5 measurements by microphones 111 and 1 1 3 . A loudspeaker positioned in an environment of 
user 155, like loudspeaker 120, is used to generate soxmd to be measured by microphones 111 
and 113. Atter the measurement one can dispose ofloudspeaker 120, since the resulting 
measurement data can be stored in memory, and used for obtaining HRTF coefficients when 
necessary. Coefficient calculation means 132 applies an algorithm for obtaining HTRF 
1 0 coefficients from microphone measurements. 

There are a niimber of applications which benefit from the use of a second 

HRTF. 

Firstly, the true HRTF 200 can not be obtained accurately from the 
microphone measurements for all frequencies. E.g., for high frequencies, e.g. over 9 kHz, the 

15 position of the microphones 1 1 1 and 1 13 is very critical, and hence the obtained values of the 
HRTF for high frequencies are inaccurate, which could again result in bad localization of 
simulated sound sources. In a consumer application it is desirable that the microphone does 
not enter the ear canal, which results in reduced accuracy of the HRTF coefficients for high 
frequencies. Furthermore if the HRTF measurement is performed by means of noise 

20 cancellation, obtaining accurate HRTF coefficients for high frequencies implies that the user 
. should sit very still. For low firequencies, there is oflOT enviroxmiental noise present, however 
the algoriliuns for obtaining the HRTF have no way of deciding wheflier this noise originated 
from the loudspeaker 120. This can lead to inaccurate HRTF values in the low frequency 
region, e.g. below 200 Hz. More reliable HRTF values for a predetermined second frequency 

25 range F2 can be obtained from a different source than in situ microphone measurement for 
obtaining the HRTF in an unequal predetermined first frequency range Fl, e.g. from a 
dedicated laboratory measurement. These more reliable values are used in the second HRTF. 

Secondly, in case a user 155 wants to change a certain part of a HRTF, e.g. to 
make a virtual room that he simulates sound more like an opera house or outdoors instead of 

30 a room he is present in, it is desirable to realize this with as few operations as possible, and 
without any additional microphone measurements. This is facilitated by applying several 
HRTFs. 

Thirdly the choice of possible HRTF filter structures is diversified. Typically a 
finite unpulse response (FIR) filter is used for realizing the HRTF. However for the low 
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frequencies, sound keeps reflecting off the walls for a long time, necessitating a FIR filter 
with many coefficients. It is advantageous to realize the filtering of the lower frequencies 
with an Infinite Impulse Response (ITR) filter. This is e.g. realized when the filter 130 is 
realized as a first HRTF filter which filters the higher frequencies and leaves the low 
5 frequencies unchanged, and second HRTF filter which is realized as an IIR filter, filtering 
only the low frequencies. The filters can be disjunct means, or can be realized sequentially on 
a processor. 

Fig. 2 schematically shows a true HRTF 200 — solid line- as it can be 
measured for a particular user 155, for a loudspeaker 120 e.g. straight in front of the user 155, 

10 and at a certain elevation. An amplitude A of an audio signal as measured by the microphone 
113 positioned e.g. near the entrance of an ear is shown for different frequencies £ The effect 
of environmental objects, such as the ear pinna, on sound traveling from loudspeaker 120 to 
the position of the microphone 113 is then characterized. This effect is different when the 
loudspeaker producing the sound is e.g. headphone loudspeaker 103, which is closer to the 

1 5 microphone 113, and of which the sound does not interact with e.g. a wall in the environment 
of user 155. The low frequency behavior is determined amongst others by the room and the 
torso of the user 155. At around 4 kHz there is apeak due to positive interference in the ear . . 
canal. Somewhere around a notch frequency - e.g. between 6 and 12 kHz-, there is a 

destructive interference due to reflections on the pinna, called the "piima notch**. In the 
20 schematic true HRTF 200, in a second frequency range F2 higher then e.g. 1 2 kHz, the 
microphone measurement is inaccurate. A simple embodiment of the sound reproduction 
system according to the invention specifies e.g. the first HRTF to be the true HRTF 200 in a 
first frequency range Fl upto 12 kHz, and uses e.g. a modeled HRTF 202- dashed line- as the 
second HRTF for the frequencies in a second frequency range F2 over 12 kHz. In this second 
25 frequency range F2 the sound modification can be modeled e.g. by a head shadow model, as 
e.g. 

ITD = -[6 + sind] , and 
c 

iZD=l+sm^^^ 

in which ITD is an interaural time difference, r the radius of the head of user 1 55 or an 
30 average user, c the speed of sound, 0 an azimuthal direction of a soimd source, IID an 

interaural intensity difference, and fa frequency. If the first HRTF is substantially the true 
HRTF 200 over a sufficientiy large frequency range, and the second HRTF comprises 
reasonably correct coefficients, good soxmd localization occurs. Other models can be applied. 
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e.g. a Kahunen-Loeve expansion from simpler HRTFs or HRTFs measured e.g. in a 
laboratory. Outer ear and ear canal geometrical or sound filtering parameters can be used to 
obtain a desired HRTF model, the parameters being obtained e.g. from measurements by 
microphones 111 and 113 of sound emitted by headphone loudspeakers 101 and 103. 
5 Another embodiment comprises modelization of the pinna notch. The notch frequency 

shifts with the elevation of a sound source. In a gaming application e.g., a flying sound 
source can be simulated as follows. In this embodiment, the first HRTF is e.g. equal to the 
true HRTF 200 over the entire frequency range, except that between 6 and 12 kHz no 
filtering occurs. The second HRTF applies a notch with a notch frequency/^ dependent on 

10 the height of the flying soimd source in a frequency range e.g. between 6 and 12 kHz. Head 
tracking means might also be present to obtain head position parameters for obtaining 
modeled HRTF 202. Another example of a functionality that can be realized with the sound 
reproduction system is on the fly simulation of room changes. E.g. putting a virtual carpet on 
the floor, which increases absorption of highra: frequencies, can be modeled by replacing 

1 5 modeled HRTF 202 by room feature modeling HRTF 204 — dotted line. Calculating means 
152 perform the necessary modeling calculations to obtain the correct values for the first 
and/or the second HRTF. Care should be taken that the amplitude level of the first and the 
second HRTF are tuned to each other, and in case of switching to a different second HRTF, 
artifacts like audible clicks should be avoided by applying a transition strategy. 

20 It should be clear that the proposed implementations of the HRTFs can be 

realized in the time domain as well as in the frequency domain. Instead of applying the sound 
field corresponding to a virtual source in the environment to the ears of xiser 155 by means of 
headphone speakers 101 and 103, speakers positioned in the room like loudspeaker 120 can 
be used, as is described in WO-A-0149066. The audio processor 105 or any of its 

25 constituents can be realized as a separate entity, or can form part of the audio signal source 
1 10 or tiie headphones. 

Fig. 3 shows a facility for HRTF measurement. Such a professional facility 
can be present e.g. in a dedicated music shop. In stead of a cheap microphone 111 
incorporated in headphones, a more professional microphone 302 may be present, which e.g. 

30 measures deeper in the ear canal of user 310. Furthermore, professional loudspeakers 305, 
306 may be present, as well as professional acoustic wall covering. Audio processing 
apparatus 301, may perform further calculations on the data received from the professional 
microphone 302, such as e.g. simulating extra room reverberations to model a concert hall. 
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These calculations can be performed directly on the professional microphone 302 signal or 
on the HRTF obtained therefrom. Data for the second HRTF can e.g. be downloaded from a 
database on the Internet to which only the shopkeeper has access, via a signal transmission 
system 304. The two HRTFs are tiien e.g. stored on data carrier 303, which can be e.g. an 

5 optical disk or flash memory in an MP3 player of user 3 10. Alternatively, the HRTFs can be 
tc wsmitted over signal transmission system 304, e.g. the Ihtemet or a wireless connection to 
e.g. a portable device. The second frequency range F2 is so chosen that the second head 
related transfer function 202, 204 comprises information complementary to that of the first 
head related transfer function 200, so that when both HRTFs are used together by filter 130, 

' the simulation of sound from a loudspeaker 120 by means of sound production means 101 is 
improved, as far as e.g. sound localization is concerned, compared to when using either 
HRTF on its own. Both HRTFs can be combined to a single HRTF before storage or 
transmission. Another possibility is that a number of additional HRTFs are stored over the 
second frequency interval, e.g. a number of pinna notch HRTFs to quickly simulate the 

1 5 vertical position of a soxmd source. 

It should be noted that the above-mentioned embodiments illustrate rather than 
limit the invention and that those skilled in the art are able to design altematives, without 
departing from the scope of the claims. Apart from combinations of elements of the invention 
as combined in the claims, other combinations of the elements within the scope of the 

20 invention as perceived by one skilled in the art are covered by the invention. Any 

combination of elements can be realized in a single dedicated element Any reference sign . 
between parentheses in the claim is not intended for limiting the claim. The word 
"comprising" does not. exclude the presence of elements or aspects not listed in a claim. The 
word "a'' or "an" preceding an element does not exclude the presence of a plurality of such 

25 elements. 

The invention can be implemented by means of hardware or by means of 
software running on a computer, and previously stored on a data carrier or transmitted over a 
signal transmission system. 
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CLAIMS: 



1 . A sound reproduction system comprising: 

an audio processor with a filter arranged for applying a first head related 
transfer function over a predetermined first frequency range to an input audio signal &om an 
audio signal source, and yielding an output audio signal for a sound production means; and 

a first data source, which is arranged for delivering fiarst filter coeflScients of 
the first head related transfer fimction to the filter, 

characterized in that 

a second data source is comprised, which is arranged for delivering second 
filter coefficients of a second head related transfer fimction over a predetermined second 
frequency range, unequal to the first frequency range, to the filter for filtering the input audio 
signal yielding the output audio signal. 

2. A sound reproduction system as claimed in claim 1, characterized in that 
a microphone is included for performing a sound measuremCT*; and 

the first data source comprises coefficient calculation means for calculating the 
first filter coefficients from the sound measurement; and 

the second data source comprises a memory for storing data related to the 
second head related transfer fimction filter. 

3. A sound reproduction system as claimed in claim 2, characterized in that 

die second data source comprises calculating meam for calculating the second 
filter coefficients based on data from the memory. 

4. A computer program for execution by a processor, describing a method of 
sound reproduction comprising the steps: 

obtaining coefficients of a first head related transfer function from a first data 

source; 

applying a first head related transfer fimction filtering to an input audio signal 
from an audio signal source, yielding an output audio signal. 
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characterized in that the computer program comprises further steps in its 

method of; 

obtaining coefficients of a second head related transfer function from a second 
data source; and 

5 - applying a second head related transfer function filtering to an input audio 

signal from an audio signal source, yielding an output audio signal. 



5, A data carrier storing a computer program for execution by a processor, 

describing a method of sound reproduction comprising the steps: 
10 - obtaining coefficients of a first head related transfer function from a first data 

source; 

applying a first head related transfer function filtering to an input audio signal 
from an audio signal source, yielding an output audio signal, 

characterized in that the data carrier stores further steps of the method: 
15 - obtaining coefficients of a second head related transfer function from a second 

data source; and 

- applying a second head related transfer function filtering to an input audio 

signal from an audio signal source, yielding an output audio signal. 

20 6. A data carrier storing a first head related transfer function over a first 

• predetermined frequency range, characterized in that 

also a second head related transfer function over a second predetermined 
frequency range is stored, and 

the second head related transfer function comprises complementary 
25 information, improving the simulation of sound from a loudspeaker by means of soimd 
production means. 

7. Signal transmission system transmitting a first head related transfer function 

over a first predetermined frequency range, characterized in that 
30 - also a second head related transfer function over a second predetermined 

frequency range is transmitted, and 

the second head related transfer function comprises complementary 
information, improving the simulation of sound from a loudspeaker by means of sound 
production means. 
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The sound reproduction system comprises a sound production means (101), 
e.g. a headphone loudspeaker, and an audio processor (105) capable of applying a first head 
related transfer function (130) to an input audio signal (106) from an audio signal source 
(1 10) and capable of deUvering an output audio signal (108) to the sound production means 
5 (101). The audio processor (105) is arranged to apply the first head related transfer function 
(200) over a first frequency range and a second head related transfer function (202, 204) over 
a second frequency range. The second head related transfer function (202, 204) may be. 
modeled by calculating means (152) or selected from a memory (150). 
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